MICROBIAL CELL 
FACTORIES 




Knock-in/Knock-out (KIKO) vectors for rapid 
integration of large DNA sequences, including 
whole metabolic pathways, onto the Escherichia 
coli chromosome at well-characterised loci 

Sabri et al. 




BraMed Central 



Sabri et al. Microbial Cell Factories 2013, 12:60 
http://www.microbialcellfactories.eom/content/12/1/60 



Sabri et a I. Microbial Cell Factories 2013, 12:60 
http://www.nnicrobialcellfactories.conn/content/1 2/1/60 



TECHNICAL NOTES Open Access 



Knock-in/Knock-out (KIKO) vectors for rapid 
integration of large DNA sequences, including 
whole metabolic pathways, onto the Escherichia 
coli chromosome at well-characterised loci 

Suriana Sabri^ Jennifer A Steen^ Mareike Bongers, Lars K Nielsen and Claudia E Vickers"" 




MICROBIAL CELL 
FACTORIES 



Abstract 

Background: Metabolic engineering projects often require integration of nnultiple genes in order to control the 
desired phenotype. However, this often requires iterative rounds of engineering because many current insertion 
approaches are limited by the size of the DNA that can be transferred onto the chromosome. Consequently, 
construction of highly engineered strains is very time-consuming. A lack of well-characterised insertion loci is also 
problematic. 

Results: A series of knock-in/knock-out (KIKO) vectors was constructed for integration of large DNA sequences onto 
the E. coli chromosome at well-defined loci. The KIKO plasmids target three nonessential genes/operons as insertion 
sites: arsB (an arsenite transporter); locZ (3-galactosidase); and rbsA-rbsR (a ribose metabolism operon). Two 
homologous 'arms' target each insertion locus; insertion is mediated by A Red recombinase through these arms. 
Between the arms is a multiple cloning site for the introduction of exogenous sequences and an antibiotic 
resistance marker (either chloramphenicol or kanamycin) for selection of positive recombinants. The resistance 
marker can subsequently be removed by flippase-mediated recombination. The insertion cassette is flanked by 
hairpin loops to isolate it from the effects of external transcription at the integration locus. To characterize each 
target locus, a xylanase reporter gene {xynA) was integrated onto the chromosomes of E. coli strains W and K-12 
using the KIKO vectors. Expression levels varied between loci, with the arsB locus consistently showing the highest 
level of expression. To demonstrate the simultaneous use of all three loci in one strain, xynA, green fluorescent 
protein {gfp) and a sucrose catabolic operon {cscAKB) were introduced into locZ, arsB and rbsAR respectively, and 
shown to be functional. 

Conclusions: The KIKO plasmids are a useful tool for efficient integration of large DNA fragments (including 
multiple genes and pathways) into E coli. Chromosomal insertion provides stable expression without the need for 
continuous antibiotic selection. Three non-essential loci have been characterised as insertion loci; combinatorial 
insertion at all three loci can be performed in one strain. The largest insertion at a single site described here was 
5.4 kb; we have used this method in other studies to insert a total of 7.3 kb at one locus and 1 1.3 kb across two 
loci. These vectors are particularly useful for integration of multigene cassettes for metabolic engineering 
applications. 

Keywords: Chromosomal integration, Homologous recombination, Plasmid, Recombineering, E. coli, Xylanase, 
GFP, CSC genes 
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Background 

Integration of large constructs into the Escherichia coli 
chromosome is desirable both for applied engineering 
projects and for examining metabolic control and other 
aspects of biological pathways. In particular, metabolic 
engineering often requires introduction of many genes to 
achieve production of industrially relevant biochemicals. 
Whole new pathways may be introduced, as well as genes 
to increase flux to metabolic precursors, decrease com- 
petition from competing pathways, and balance cofactor 
and redox requirements [1]. The resulting bioprocesses 
use renewable, sustainable feedstocks such as sucrose 
and glucose [2,3]; ultimately, lignocellulosic feedstocks 
will also become available [4]. Recent advances in 
engineering approaches are allowing for delivery of 
these 'green' replacement products at commercially- 
viable levels [5]. 

In E, coli, foreign genes may be introduced and 
expressed from extra-chromosomal elements or inte- 
grated onto the host chromosome. While relatively large 
fragments can be introduced on plasmids, plasmid-based 
expression has a number of disadvantages. These include 
the metabolic burden placed on the cell for plasmid 
maintenance and expression of gene products from mul- 
tiple copies, the potential for internal rearrangements, 
segregational instability, fluctuations in copy number 
and resulting effects on interpretation of data, and the 
burden placed on cell in presence of antibiotic or other 
selective agents required for plasmid maintenance 
[6-11]. Furthermore, unpredictable pleiotropic effects 
can result from use of plasmids [12], and inclusion of 
antibiotics for plasmid maintenance increases the overall 
bioprocess cost. In particular, high copy-number plasmids 
are known to exert a much stronger metabolic drain than 
low copy- number vectors [13], and for this and associated 
reasons, low copy-number vectors are often found to be 
superior for engineering and other applications [11,14,15]. 

Integration onto the chromosome solves many of the in- 
herent problems associated with plasmid use, and is there- 
fore a preferable approach in many cases. Constructs can 
be stably maintained in the absence of selective pressure, 
the metabolic burden from plasmid maintenance is 
removed, copy number and segregation are stable and 
rearrangements are much less likely to occur. Despite 
the decrease in copy number, productivity can in fact 
be improved by engineering of genes on the chromo- 
some [16,17]. 

Chromosomal integration in E. coli may be achieved 
through use of transposons (either random or site- 
specific; [18,19]) or recombination mediated by phages/ 
phage-derived elements [20-25]. Phages can be used to 
integrate large DNA fragments. However, the integration 
sites are fixed, large regions of non-target DNA may be 
integrated in parallel, and dedicated labs are required for 



phage manipulation due to the risk of contaminating 
phage-free strains. Use of phage-derived elements for 
homologous recombination circumvents several of these 
problems. For example, specifically-designed integrative 
plasmids (CRIM plasmids) carrying phage attachment 
{attP) site can be used to insert large DNA fragments at 
bacterial phage-attachment {attB) sites via in trans 
expression of phage-derived integration {int) and exci- 
sion {xis) genes [21]. An alternative approach that allows 
greater flexibility in the insertion locus is the \ Red re- 
combinase system [25-28]. A PCR-based version of this 
method is widely used for the targeted deletion of 
chromosomal genes in E. coli [23]. This method relies 
on the replacement of a chromosomal sequence with an 
antibiotic marker that is amplified by PCR using primers 
with -50 bp homology extensions to the flanking re- 
gions of the target sequence. The antibiotic resistance 
gene template is a plasmid; flippase recognition target 
(FRT) sites are included in order to remove the anti- 
biotic resistance gene after chromosomal integration, 
allowing recycling of the resistance gene for later engi- 
neering steps. Recombination onto the chromosome is 
mediated by the Red recombinase derived from the \ 
phage. This recombination system consists of three 
genes (y, p, exo), which encode the phage recombinase 
functions as well as an inhibitor of the host RecBCD 
exonuclease V (which normally mediates degradation of 
linear DNA in the cell) [29]. 

The PCR/X Red method can also be used for co- 
integration of relatively short sequences that can be gen- 
erated through PCR, e.g. introduction of point muta- 
tions, promoter replacement, and promoter-less reporter 
genes for promoter tagging experiments [16,30-34]. 
However, the efficiency of integration drops sharply for 
DNA fragments above about - 1500 bp and integration 
of fragments larger than 2500 bp using 50 bp homolo- 
gous arms is very difficult [23,35], though integration of 
fragments up to 3,500 bp has been successful [36-38]. 
Since the resistance gene cassette is 1000-1500 bp (de- 
pending on which resistance gene is used), integration of 
genes of interest is functionally limited to a few genes at a 
time. This necessitates many iterative rounds of engineer- 
ing for introduction of large pathways or multiple genes, 
even when synthetic operons are constructed to minimise 
sequence length. It has been demonstrated that increasing 
the region of homology serves to increase the efficiency of 
integration for large fragments by an order of magnitude 
[24]. However, this quickly becomes impractical using 
PCR approaches as the homology region is conferred by 
the primer sequences, and synthesis of very long primers 
is both expensive and more error-prone. Furthermore, the 
frequency of errors in amplified constructs increases with 
increasing amplicon length, and the yield of amplicons 
decreases with increasing length. 
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More recently, two-step techniques that combine the 
use of the \ Red recombinase with the yeast mitochon- 
drial homing endonuclease I-Scel have been developed 
for introduction of large DNA fragments onto the E. coli 
chromosome [35,39]. I-Scel has an 18-bp recognition 
site which is not found in the E. coli genome; since 
double-stranded DNA breaks stimulate in vivo recombin- 
ation, the efficiency of recombination can be increased by 
several orders of magnitude by using I-Scel sites at the 
target locus [40]. Two approaches have been described. In 
the first method, (p^O-attB sites are introduced onto the 
chromosome, allowing subsequent integration of a CRIM 
plasmid bearing attP sites and the target DNA fragments 
(Dual In/Out strategy; [41]). I-Scel sites are used to im- 
prove cloning efficiency of large DNA fragments into the 
CRIM plasmids prior to integration [39]. This approach 
avoids use of PGR for cloning, and allowed cloning and 
integration of an 8 kb fragment. The second method 
involves first using X Red recombinase to introduce a 
'landing pad' onto the genome that includes a tetracycline 
resistance gene flanked by I-Scel sites and short sequences 
homologous to the ends of the desired integration cassette 
[35]. Once this landing pad is integrated, the cassette 
containing the sequences of interest is introduced by di- 
gestion with plasmid-expressed I-Scel in the presence of X 
Red recombinase. The utility of this method was demon- 
strated by introduction of a 7 kb fragment at six different 
loci, and by combinatorial integration at two different loci 
in one strain. Using these two-step techniques, it is possible 
to insert very large DNA fragments onto the chromosome 
in any desired location. 

Where flexibility of location is less important than the 
ability to quickly and efficiently integrate large DNA 
fragments at well-characterised loci (for example, many 
metabolic engineering projects), a one-step method of 
integration is preferable. A second consideration is the 
lack of availability of well-characterised integration sites 
for insertion of multiple genes. Insertion at different sites 
on the chromosome has a significant effect on expression 
levels; the effect is mediated through interference from 
local sequences, gene orientation, and insertion position 
relative to the chromosomal origin of replication [42]. It is 
therefore of value to know the relative expression levels 
when genes are inserted at different loci. In addition, it is 
necessary to avoid disruption of normal growth. 

Here we present an alternative, one-step method for 
integration of large DNA fragments onto the E. coli 
chromosome. The method relies on a series of knock-in/ 
knock-out (KIKO) vectors which facilitate the integra- 
tion (knock-in) of large DNA sequences onto the E. coli 
chromosome and the parallel interruption (knock-out) 
of native genes at defined, well-characterised loci using 
the X Red system. To increase efficiency of integration, 
we used large homologous regions (~ 500 bp). Three 



non-essential loci were characterised as integration sites; 
combinatorial integration at all three sites in one strain 
is successfully demonstrated. 

Results and discussion 

Selection of chromosomal sites for recombination 

The target loci used for development of the KIKO vec- 
tors were selected with laboratory/industrial growth in 
mind. Two of the loci are involved in PTS uptake of 
compounds typically not used in laboratory or industrial 
growth conditions: the arsenite transporter arsB [43], 
and the ribose transporter operon rbsA-rbsR [44]. The 
third locus was the well-characterised lactose catabolic 
enzyme gene p-galactosidase, lacZ, It has been previously 
demonstrated that interruption of these ORFs is not dele- 
terious for E. coli growth or cell division under standard 
cultivation conditions [43,45]. To ensure broader appli- 
cation to different E. coli strains, these loci were also 
screened and found to be conserved across a large cross- 
section of E. coli strains, including B, C, W, and K-12 as 
well as many pathogenic strains (data not shown). The rbs 
operon {rbsABCKR) is incomplete and non-functional in 
E, coli W, an industrially relevant sucrose utilizing strain 
[2], however the target sequences for homologous recom- 
bination are conserved. The lacZ locus is often subjected 
to modification in laboratory strains, and has been suc- 
cessfully used as a site for integration of sucrose utilization 
genes in E. coli K-12, C and B [46]. Furthermore, when 
using lacZ as a target locus, blue/white screening can be 
used to facilitate identification of insertion mutants. The 
positions of all three loci on the E, coli MG1655 genome 
are shown in Figure 1. 




Figure 1 Location of KIKO target loci on the genome of E coli 
IVIG1655. Map generated using the Microbial Genome Viewer 
(Iittp://mgv2.cnnbi.ru.nl; [47]). Locus positions relative to or/C are 
conserved in E. coli W. Arrows represent gene orientation. 

V . ) 



Sabri et a I. Microbial Cell Factories 2013, 12:60 
http://www.nnicrobialcellfactories.conn/content/1 2/1/60 



Page 4 of 14 



Design and construction of the KIKO vectors 

The generic features of the KIKO vectors are shown in 
Figure 2A. They share a common backbone encoding 
the R6K ^/r^-dependent origin of replication (on) and 
an ampicillin antibiotic resistance cassette. The R6K ori 
requires the n protein, which is supplied in trans by the 
\pir gene [48,49]. Use of the R6K ori ensures that the 
plasmid cannot be maintained in target strains lacking 
\pir; therefore, antibiotic resistance should not occur in 
the absence of integration. Homologous arms of -500 
bp each for each target locus are included in the KIKO 
vectors; this large amount of homology serves to in- 
crease recombination efficiency [24]. Embedded between 
the homologous arms is a multiple cloning site (to facili- 
tate the insertion of genes of interest) as well as an anti- 
biotic resistance gene cassette (chloramphenicol or 
kanamycin). The antibiotic resistance cassette is flanked 
by flippase recombination target (FRT) sites to allow for 
the removal of the antibiotic resistance cassette after 
insertion of the construct onto the chromosome so that 
these markers maybe reused as necessary [50,51]. There 
are three main influencers on gene expression for 
inserted DNA: local sequence context, gene location, 
and orientation [42]. Of these, the effect of local se- 
quence context is the most pronounced when expres- 
sion is controlled by constitutive promoters and the use 
of transcriptional terminators is absent [42]. Therefore, 



the entire insertion construct was flanked with sequences 
encoding hairpin loops [52] to act as transcriptional termi- 
nators. These repeats effectively isolate the integrated 
genes from the influence of transcriptional events at adja- 
cent loci [42]. The process for one round of integration 
using these vectors is shown in Figure 2B. 

Integration at the target loci does not affect growth rate 

We tested the KIKO vectors using two different E. coli 
strains: K-12 (MG1655) and W. E, coli K-12 (MG1655) 
and its derivatives are widely used in both laboratory 
and industry [53-55] and chromosomal integration using 
the \ Red functions is routine in this strain. E, coli W is 
of interest as it is particularly fast-growing on sucrose 
(1.2 h'^; [56]), an especially attractive renewable carbon 
source for industrial applications [2]. E, coli W also dem- 
onstrates low acetate production enabling cultivation to 
high cell densities and high tolerance to environmental 
stresses (reviewed in [2]). To examine the effect of inte- 
gration at the three selected loci, a FRT:xylanase:FRT 
cassette was inserted at each site in both W and K12 
using the \ Red recombination system [23]. Integration 
did not affect growth rate in either strain for any of the 
loci (Figure 3), indicating that the integration of exogen- 
ous DNA at these loci is not deleterious for growth rate 
or viability under standard laboratory conditions. As we 
have previously observed [46], strain W has a relatively 
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DAY 1 

Clone genes of interest between homologous arms and transform into competent pir+strain 
OR construct insertion cassette by Gibson assembly (go to Day 3 step 2) 
DAY 2 

1 . Test colonies from ligation by colony PGR. Select positives for plasmid preparation and 

sequencing. Streak out to purify colonies 
DAY 3 

1 . Inoculate culture from single colony for plasmid preparation 

2. Inoculate strain of interest harbouring pKD46 for preparation of competent cells 
DAY 4 

1 . Prepare plasmid and excise insertion cassette (Gibson-assembled fragments are ready to 
use). Sequence plasmid for QG 

2. Prepare competent cells using strain of interest (induce recombinase during preparation) 

3. Transform insertion cassette into induced competent cells 
DAYS 

1 . Screen colonies by colony PGR (target both integration junctions) 

2. Select positives and run QG. Streak out to purify. 
DAYS 

Inoculate broths for glycerol stocks and competent cell preparation 
DAY 7 

Prepare competent cells and transform with pGP20 for flippase-mediated removal of resistance 

gene 

DAYS 

Screen for loss of antibiotic resistance cassette (negative selection and colony PGR) 

Figure 2 Generic features of tiie KIKO plasmids and integration protocol. A) Map of the general features of the KIKO vectors, showing the 
location of the antibiotic resistance cassettes (Ap^ ampicillan resistance; Km^ kanamycin resistance; Cm^ chloramphenicol resistance), the flippase 
recombinase target sites (FRT), the hairpin loops (HL), the homologous arms (HAl and HA2) used for recombination onto the E. coll chromosome 
and the unique restriction sites located within the multiple cloning site (MCS). B) Process for the isolation of integration mutants. This represents 
the fastest possible generation protocol by standard cloning; in practice, strain construction is often delayed be a few days for quality control 
(QC). Strain construction can be shortened by two days if insertion cassettes are constructed by Gibson assembly. 
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Figure 3 Specific growth rate (n; hr'"") of engineered strains. The growth rate of E. coli K-12 (MG1655) and W strains harbouring exogenous 
DNA integrated into various different loci was measured on LB medium in the absence of selection to allow comparison with wild type strains. 
Bars are means; errors are standard deviations (n = 3). No significant differences were identified for either strain (1-way ANOVA; p>0.05). 



high growth rate; in this case, on LB medium, W grows 
at 1.2 hr'^ compared to 0.9 hr'^ for K-12(MG1655). The 
average growth rates were slightly lower than values 
reported in the literature for shake flask and bioreactor 
experiments. For example, reported growth rates for 
MG1655 range from 1.2 hr'^ to 2.0 hr"^ [57-61]. These 
variations are due to a number of factors, including vari- 
ations between physiological behaviour of different strain 
stocks [54], the exact sampUng time during the growth 
curve [58,60], and particular growth conditions (e.g. 
pH-controUed bioreactor, shake flask, relative aeration, 
etc.). The lower growth rates observed in the micro titre 
plates are consistent with our previous observations 
[46] and with other literature (e.g. [62]), showing a 
slight growth rate retardation in microtitre plate assay 
compared with shake-flask analysis; this is due to mild 
oxygen limitation [62] . 

Expression strength is variable at different loci 

The xylanase insertion cassette was used to assess the 
difference in expression strength at each locus. Xylanase 
is used industrially to assist in degradation of plant fibre 
in the feed, food processing, pulp and paper industries 
[63]. In addition, xylanase has previously been developed 
as a reporter gene and its reaction properties have been 
well characterised [64]. Xylanase activity varied signifi- 
cantly between different integration sites (Figure 4). 
Xylanase activity also varied between the two strains; how- 
ever, the relative expression strength between the sites was 
consistent between strains: xylanase activity was the 



highest when xynA was integrated at the arsB locus, 
followed by rbsAR and then lacZ {arsB > rbsAR > lacZ), 
Variation was highest in strain K-12(MG1655), where a 
2.3-fold difference was seen between the arsB locus 
and the lacZ locus. Expression levels were also higher 
in K-12(MG1655). 

Expression of the xylanase gene should be isolated 
from the influence of local transcription units due to the 
presence of transcription terminators [42] in our con- 
structs. However, the proximity of the insertion to the 
bi-directional origin of replication {oriC) also influences 
gene expression: due to constant genome replication, 
genes closer to the oriC exist at functionally higher copy 
number [42,65]. Given the locations of our target loci 
relative to oriC (Figure 1), one might therefore expect ex- 
pression strength to fall in the order rbsAR > arsB >lacZ, 
However, gene orientation also influences expression 
levels, with genes on the leading strand being more highly 
expressed [42]. In E, coli, arsB and rbsAR are encoded on 
the leading strand, whereas lacZ is encoded on the lagging 
strand. However, in the case of the xynA insertions in each 
locus, the xynA gene cassette was always cloned in the 
opposite orientation relative to the natural expression of 
the target gene encoding the homologous arms. Thus, 
arsB and rbsAR loci xynA knock-ins are encoded on the 
lagging strand, whereas the lacZ locus xynA knock-in is 
encoded on the leading strand. Clearly, the expression 
level advantage enjoyed by genes encoded on the leading 
strand is insufficient to compensate for relatively greater 
distance from the oriC that lacZ is located (Figure 1). 
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Figure 4 Xylanase assay. Xylanase activity was measured in extracts from E. coll K-12 (MG1655) and W strains liarbouring xynA integrated into 
various different loci. Xylanase is expressed in relative fluorescence units (RFU) per minute per mg protein. Bars represent means; errors are 
standard deviations (n = 3). One-way ANOVA followed by Tukey's HSD post-hoc analysis was used to resolve differences in means; p<0.05, 
p<0.01,***, p<0.001. 



However, since the rbsAR locus is closer to oriC that arsB, 
but arsB consistently showed higher expression levels for 
both strains, other unknown factors must also influence 
gene expression levels from these loci. In such a small 
sample size it is not possible to determine what these fac- 
tors are, and investigation of these phenomena requires 
further research. It is worth noting that expression of 
inserted transgenes at loci is relatively predictable [42] as 
are relative gene expression levels from native genes under 
a wide variety of conditions [66] . This former observation 
is supported by the conservation of rank order and relative 
expression levels between the two strains (Figure 4) and 
suggests that transgenes integrated at these loci will 
behave in a predictable fashion. 

It should be noted that the actual orientation of the gene 
being expressed for any given construct made from the 
KIKO vectors depends on the direction of insertion of the 
gene cassette between the homologous arms, and this may 
affect relative expression strength between the loci. It 
should also be noted that leading strand fidelity is higher 
than lagging strand fidelity during DNA replication [67]; 
this, in addition to the generalised expectation of higher 
expression levels from leading strand-encoded genes [42], 
suggests that integration in leading strand orientation 
would be preferable when designing an experiment. 

Finally, the higher xylanase activity measured in K-12 
relative to W may be due to the relatively higher growth 
rate of W, which can result in dilution of cellular protein 
concentrations [68]. 



Integration of different exogenous sequences at all three 
loci in a single strain 

To demonstrate the functional use of all three integra- 
tion sites simultaneously, a sucrose metabolism operon 
encoded by the chromosomally-encoded sucrose catab- 
olism genes {cscAKB, 3.9 kb) and green fluorescent pro- 
tein gene {gfp, 0.8 kb) were introduced into the MG1655 
strain harbouring the xylanase gene at the lacZ locus to 
produce a triple knock-in strain. Strain K-12(MG1655) 
cannot normally utilize sucrose [2]. Introduction of the 
cscAKB genes on a high copy number plasmid causes se- 
vere phenotypic instability, and we have previously dem- 
onstrated that successful sucrose utilization occurs after 
spontaneous integration of the esc genes the lacZ locus 
(this occurs at low frequency in the absence of the X Red 
recombinase functions) [46]. While cscA alone is suffi- 
cient to confer a sucrose-positive phenotype [69-71], 
cscAKB together are required for maximal growth rates 
on sucrose [72]. Following each integration event the 
chloramphenicol antibiotic resistance gene was removed 
via FLP-mediated recombination so that it could be 
recycled in subsequent rounds of integration. The success- 
ful integration of cscAKB and gfp in strain MG^.lacZvxyn 
ArbsAR::gfjp AarsB::csc was demonstrated by a positive 
xylanase assay (Figure 4), the ability to ferment sucrose 
(Figure 5A) and expression of GFP (Figure 5B). The 
growth rate and xylanase activity in this triple knock-in 
strain were indistinguishable from the single knock-in 
strain carrying only the xylanase (Figure 3, Figure 4), 
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Figure 5 Sucrose utilisation and GFP activity in strain N\GAIacZ:v(yn ArbsARiigfp AarsBv.csc. A) MacConkey agar supplemented with 1% 
sucrose was used to examine sucrose fermentation in wild type MG1655 (left) and engineered strain hAGAIacZvxyn ArbsARv.gfp AarsBwcsc. The 
ability to utilise sucrose is indicated by a pH change resulting from fermentation products; this is evidenced by red colonies on the MacConkey 
sucrose plate. B) Wild type MG1655 exhibits no fluorescence in the presence of UV light (left), whereas the engineered strain MGAIocZr^yn 
ArbsARv.gfp AarsBwcsc fluoresces bright green (right). 



indicating that l<nocl<-in at all three loci has no negative 
effect on normal cell growth under these conditions. 

It should be noted that iterative rounds of knock-in 
can result in unintended effects (including deletions and 
rearrangements) resulting from FRT scars remaining in 
the genome from previous integration events. In other 
experiments, we have observed unintended deletions 
when subsequent secondary DNA constructs are intro- 
duced at the same locus [72]. In the case of the KIKO 
vectors described here, the target loci are well separated 
in the genome (see Figure 1) and recombination between 
them would result in large deletions which would pre- 
sumably be fatal. Strains containing rearrangements and 
deletions can be excluded from analysis by screening 
both integration junctions (as described in the Materials 
and Methods). Use of asymmetric sites for removal of 
flanking markers, such as \ attL/attR [41,73] or the 
mutant FRT [74] sites available in E. coli or mutant loxP 
sites [75,76] currently used in eukaryotic systems, would 
preclude such unintended effects when integrating 
further constructs at the same locus. 

Comparison of linearised plasmids and PGR products as 
integration templates 

While performing the integration experiments described 
above using linearised plasmids, we observed a relatively 
high frequency of single cross-over events, whereby the 
entire plasmid (instead of just the target region between 
the homologous arms) was integrated at the target locus. 
This could be identified by screening at each insertion 
junction (only a single junction could be amplified by 
PGR; see Materials and Methods) and was further 
evidenced by persistent resistance to ampicillin (encoded 
on the plasmid backbone). The resistance gene cassette 
could not be cured from these strains because of an 



inability to select for the pCP20 plasmid [51], which ex- 
presses the flippase required to remove sequences be- 
tween the FRT sites. Moreover, we also observed a high 
rate of false positives which were resistant to ampicillin 
but had a wild type locus. This represented substantial 
background. In order to quantify this background and 
examine if an alternative approach could be used to de- 
crease background, we performed a direct comparison 
to investigate the differing efficiencies of integration be- 
tween linearised plasmids and PCR-amplified fragments 
(encompassing the two homologous arms and the regions 
between the arms; see Figure 2A). 

The number of colonies obtained for each transform- 
ation event was often higher for MG1655 than for W 
(Table 1), an observation at odds with the competency 
of the cells: W was an order of magnitude higher in 
competency than MG1655 (10^ CFU/(ig plasmid DNA 
vs. 10^ CFU/(ig, respectively). We observed this consist- 
ently through a variety of other experiments (data not 
shown), and it means that experiments are generally 
more facile in MG1655. A generalised dilution of cellular 
protein (including the recombinase functions) and of 
transformed DNA fragments as a result of the faster 
growth rate of W (as described above) may also provide 
an explanation for the lower transformation efficiency 
observed. Differences may also be due to variations in 
the activity of the recombinase functions between the 
two strains. Regardless, we anticipate that similar varia- 
tions in transformation and integration efficiency will be 
seen in other strains. 

Transformations using linearised plasmid typically 
produced more colonies than those using PGR product 
(Table 1). The percentage of false positives and single 
crossover events was highly variable in the samples ex- 
amined; however in most cases it was higher in plasmid 
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Table 1 Direct comparison of integration using linearised plasmid and per products 



Locus 


Strain 


Template 


Colonies 


WT locus 


Single X 


Double X 


Background 


orsB 


W 


Plasmid 


38 


33 


2 


3 


92% 






PGR 


2 


0 


2 


0 


100% 




MG1655 


Plasmid 


38 


4 


15 


19 


50% 






PGR 


14 


0 


2 


12 


14% 


\nr7 


\A/ 
VV 


Plasmid 
PGR 


Q 
O 

6 


r 
J 

0 


1 

1 


z 
5 


/DVO 

17% 




MG1655 


Plasmid 


32 


25 


6 


1 


97% 






PGR 


39 


5 


12 


22 


44% 


rbsAR 


W 


Plasmid 


22 


0 


16 


6 


73% 






PGR 


7 


0 


5 


2 


71% 




MG1655 


Plasmid 


8 


0 


8 


0 


100% 






PGR 


15 


0 


14 


1 


93% 



Templates for each transformation (linearised plasmid or PGR) are noted, as well as the number of colonies obtained from each transformation, the number of 
colonies having a wild type locus (no integration), the number of colonies having a single cross-over (Single X; integration of entire plasmid) and the number of 
colonies having a double cross-over (Double X; integration of only the region contained between the homologous arms). Background % is calculated based on 
the number of WT locus + Single X colonies. 



transformations than in PGR transformations. While the 
lower number of colonies meant that PGR transformation 
occasionally did not yield any double crossover events, in 
most cases the background of single cross-over events was 
lower in PGR transformations; furthermore, false positives 
with no insertion at the target locus were very rare. Single 
cross-overs occurred overall at a similar frequency to 
double cross-overs across all experiments. Where single 
cross-over events occurred, the single cross-over consist- 
ently occurred with high preference (80-100% across all 
three loci) on the side homologous with the 3' sequence 
of the gene in the insertion locus (data not shown). We 
have no good explanation for this observation. 

The high incidence of false positives (ampicillin resistant 
colonies with a wild type insertion locus) is surprising. It is 
unlikely to occur from residual uncut plasmid, because 
the plasmid replicates via the R6K ori and so cannot be 
maintained in the absence of a \pir gene [48,49]. There- 
fore, we presume that the plasmid must be integrating 
elsewhere on the chromosome. Spontaneous mutation is 
also a possibility, although rates are too high for it to be 
the only possibility. 

There were no clear differences between the different 
insertion loci, with the exception that false positives 
were never found when integrations were attempted at 
the rbsAR locus. This may be due to the small sample 
size used, or may be due to some unknown feature of 
the rbsAR insertion vector/locus. 

Conclusions 

Incorporation of large DNA fragments onto the E. coli 
genome at precise, well-defined loci is desirable both for 
metabolic engineering applications and for examining 
whole biological pathways. We have constructed a series 



of knock-in/knock-out (KIKO) plasmid vectors designed 
to successfully achieve this in one step. We also address 
the need for multiple reliable insertion sites in one gen- 
ome in order to facilitate complex engineering in E. coli 
and show that all three loci characterised can be used 
for integration in a single strain. The variation in expres- 
sion level at the different loci serves to highlight the 
need for well characterised integration sites. We have 
shown the utility of this system in two laboratory strains, 
and we anticipate that it will be generally applicable 
across a range of strains. 

The KIKO vectors have been used successfully within 
our laboratory for the integration of a variety of genes 
for production of industrially relevant products, inclu- 
ding the polyhydroxybutyrate operon in E, coli W 
(phbCBA -H selection marker, 5.8 kb total) and the 
methylerythritol pyrophosphate isoprenoid pathway in 
both W and K-12 (8 genes across three loci, including 
insert sizes of 7.3 kb at the lacZ locus and 4.0 kb at the 
arsB locus). Synthetic constructs for expressing multiple 
genes are typically synthesized commercially unless they 
are already available. Although the method described 
here involves linearization of the integration vector, we 
subsequently found that integration of the entire plasmid 
can occur at relatively high frequency. This may result 
from poor integration at the second homologous region, 
or from integration of residual non-linearised plasmid 
via host-encoded recombination functions (more likely 
in wild type unmodified strains that retain their native 
recombinase functions). We have found that either 
complete excision of the insert cassette by digestion at 
both ends (data not shown) or PGR amplification of the 
insertion cassette (Table 1), followed by gel excision of 
the fragment (in the former case) and/or Dpnl digestion 
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to remove plasmid DNA (in the latter case), significantly 
increases the frequency of the desired recombination 
event. We have also used Gibson assembly [77] using 
the KIKO vectors as PGR templates to successfully con- 
struct insertion cassettes for integration. This approach 
shortens the strain construction process by two days and 
removes the vagaries of standard cut-and-paste ligation- 
mediated cloning, and is currently our preferred approach 
for production of complex insertion cassettes. Although 
we have not tested them specifically, it is also theoretically 
possible to use the KIKO vectors as templates to produce 
knock-out fragments as described by Datsenko & Wanner 
[23]; the universal priming site P2 is intact, but the PI 
site is truncated by two base pairs, necessitating de- 
sign of a slightly different primer for this application. 
The KIKO plasmids will be available from Addgene 
(http://www.addgene.org/). 

Methods 

Bacterial strains, media and growth conditions 

Bacterial strains are shown in Table 2 and plasmids are 
shown in Table 3. R6K-based vectors were maintained in 
E, coll BW23474 [78]. £. coll W (NGIMB 8666, National 
Gollection of Industrial, Food and Marine Bacteria, 
Aberdeen, UK) and MG1655 (£. coli Genetic Stock 
Gentre, GGSG 7740) were used as targets for genomic 
integration. E. coli strains were grown in LB medium 
[79] supplemented with ampicillin (100 (ig/mL), kanamy- 
cin (50 (ig/mL), and/or chloramphenicol (40 (ig/mL) as 
appropriate, unless otherwise stated. Sucrose utilization 
experiments were performed using MacGonkey agar 



(Difco, BD, North Ryde, NSW, Australia) supplemented 
with 1% sucrose or M9 medium [79] supplemented with 
1 mg/L thiamine and 2% sucrose (M9S). 

Molecular techniques 

Standard molecular cloning procedures [79] were 
employed. Oligonucleotide primers used in this study are 
described in Table 4. PGR products and restriction endo- 
nuclease treated-plasmid DNA were purified using a 
MinElute PGR purification kit and QIAquick gel extrac- 
tion kit (Qiagen, Doncaster, VIG, Australia), respectively. 

Construction of the KIKO vectors 

The KIKO vectors (detailed in Table 5) were constructed 
in two stages. In the first stage, the R6K origin and ampi- 
cillin resistance {bluy Ap^) gene was amplified from pKD4 
[23] using primers JSP64 and JSP65 and ligated to - 1 kb 
fragments amplified from E, coli W of arsB (primers JSPIO 
and JSPll), lacZ (primers JSP55 and JSP56), and prbsAR 
(primers JSP57 and JSP58) to generate the base vectors 
parsB'R6K, placZ-RGK, prbsAR-RSK respectively. A syn- 
thetic cassette that contained a multiple cloning site 
(MGS) and a chloramphenicol resistance gene {catP; Gm^) 
flanked by FRT recombination sites [23] flanked by tran- 
scriptional terminators (hairpin loops) from the omega 
cassette [52] was synthesized in a plasmid backbone 
(DNA 2.0; GA, USA). In the second round of cloning, the 
synthetic cassette was excised from the synthesized 
plasmid using Pmel and subcloned into unique blunt 
sites within the base vectors (arsB, EcoRV; lacZ, EcoRV; 
rbsAR, XmnI) to generate the Gm^ KIKO vectors 



Table 2 Strains used In this study 


Strain 


Properties 


Reference/source 


BW23474 


A(orgF-loc)169, AuidA4::plr-116, recAl, rpoS396(Am), endA9(del-ins)::FRT, rph-1, hsdR514, 
rob- 1 creCSlO 


[78]^ 


MG1655 


X-, rph-1 


CGSC^ 


W\GMrsB:xyn 


MG1655 orsB-.xynAcotP; Cm^ 


Tliis study 


MGArbsARwxyn 


MG1655 rbsAR-.xynAcatP; Cm^ 


Tliis study 


W\GMacZ-:xyn 


MG1655 iQcZ.xynAcQtP] Cm^ 


Tliis study 


MGA/6?cZ::xynFRT 


MG1655 lQcZ:xynA-VK\] Cm^ 


Tliis study 


MGA/6/cZ::xyn ArbsAR::gfp-Crf] 


MG1655 lQcZ:xyr]A-m rbsARwgfpcatP; Cm^ 


Tiiis study 


MGAIacZvxyn ArbsARv.gfp 


MG1655 AlocZ:xynA-m ArbsA-rbsR::gfp-m; Cm^ 


Tliis study 


MGAIacZvxyn ArbsARv.gfp 
AarsB::csc-Cm 


MG1655 AlocZ:xynA-m ArbsA-rbsR::gfp-m AorsBr.cscAKBcotP; Cm^ 


Tliis study 


W\GAlQcZ:xyn ArbsAR.gfp AorsBxsc 


MG1655 AlocZ:xynA-m ArbsA-rbsR::gfp-m AorsB::cscAKB-m; Cm^ 


Tliis study 


W 


Wild type strain 


NCIMB 8666^ 


V\/AarsB:xyn 


W arsB-.xynAcotP; Cm^ 


Tliis study 


WAIacZwxyn 


W locZ-.xynAcotP; Cm^ 


Tliis study 


\NArbsAR:xyn 


W rbsAR-.xynAcotP; Cm^ 


Tliis study 



^Purchased from the Coli Genetic Stock Centre (CGSC). 

^National Collection of Industrial Bacteria, Aberdeen, Scotland. This strain is also archived as ATCC9637 in the American Type Culture Collection. 
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Table 3 Plasmids used in this study 


Plasmid 


Properties 


Reference/ source 


pKD3 


oriR6Kgamma bio cotP 


[23] 


pKD4 


oriR6Kgamma bla aphA 


[23] 


pi\U4D 


repAIOIts & oriRIOI ParaB ^^o, bet, gom ofoCblo 




pCP20 


Flp recombinase expression plasmid 


[51] 


pNPDX2 


PBTac2 (Boehringer) carrying modified xynA cDNA from Neocollimastix patriciorum 


[80] 


pBAVI K-T5-gfp 


pBAVI K-T5 plasmid that carries gfp gene 


[81] 


pCSCX 


pCR2.1 plasmid carrying cscA, cscK and cscB gene from E. coll W 


[46] 


pJASOl 


Synthetic construct. Contains MCS & FRT-catP-FRT cassette flanked by hairpin loops. 


This study 


parsB-RGK 


JSP64/JSP65 PCR product from pKD4 (Pvull; 1,495; R6K or/ and bio) OJSP10/JSP1 1 PCR product 
(Pmel; 1,109; arsfiO 


This study 


p/acZ-R6K 


JSP64/JSP65 PCR product from pKD4 (Pvull; 1,495; R6K ori and bio) OJSP57/JSP58 PCR product 
(rmei, 1 , J 1 o, /ucz ) 


This study 


prOS/in-r\Or\ 


JorD4/JorDJ rLn prOQUCt Trom pr\U4 (rVUII, 1 ,4yj, nOr\ Oil anO OIQ) LiJor j j/Jor JD rLn prOQUCt 

(Pmel; 1,034; rbsAR') 


This study 


pKIKOarsfiCm 


pcirs8-R6K (Pmel; 2,604; linear) OpJASOl (Pmel; 1,143; W\CS/cQtP cassette) 


This study 


pKIKO/acZCm 


p/acZ-R6K (Pmel; 2,813; linear) OpJASOl (Pmel; 1,143; W\CS/cotP cassette) 


This study 


pKIK0r6s/\/?Cm 


pr6s/\/?-R6K (Pmel; 2,529; linear) OpJASOl (Pmel; 1,143; W\CS/cQtP cassette) 


This study 


pKIKOarsfiKm 


pKIKOarsfiCm (Xbal; 2,839; cotP drop-out) OpKD3 (Xbal; 930; ophA) 


This study 


pKIKO/acZKm 


pKIKO/acZCm (Xbal; 2,764; cotP drop out) OpKD3 (Xbal; 930; ophA) 


This study 


pKIK0r6s/\/?Km 


pKIK0r6s/\/?Cm (Xbal; 3,1 11; cotP drop out) OpKD3 (Xbal; 930; ophA) 


This study 


pKIKOarsfiOxynCm 


pKIKOarsfiCm (Pstl/Sall; 3,763) OpNPDX2 (Pstl/Sall; 1,108; xyn/\) 


This study 


pKIKO/c/cZOxynCm 


pKIKO/acZCm (Pstl/Sall; 3,688) OpNPDX2 (Pstl/Sall; 1,108; xyn/\) 


This study 


pKIKOr65/\/?OxynCnn 


pKIK0r6s/\/?Cm (Pstl/Sall; 4,035) OpNPDX2 (Pstl/Sall; 1,1 08; xyn/\) 


This study 


^mOrbsARClgfpCm 


pKIK0r6s/\/?Cm (Sacl/Spel; 4,035) OpBAVI K-T5-gfp {SQc\/Spe\; 1,849; ophA & gfp) 


This study 


pKIKOars80csc/\/(5Cm 


pKIKOarsfiCm (EcoRI; 3,771) OpCSCX (EcoRI; 4,064; cscAKB) 


This study 



pKIKO^rs^Cm [GenBank KC503965] pKIKO/^cZCm 
[GenBank KC503966] and pKIKOrZ?5A7?Cm [GenBank 
KC503967]. To generate KIKO vectors conferring 
kanamycin resistance {aphA; Km^), the catP gene was 
replaced with an aphA-contdiinmg fragment from 
pKD3 [23] using Xbal sites. This produced the Km^ 
KIKO vectors pKIKO^rs^Km [GenBank KC503968], 
pKIKO/^cZKm [GenBank KC503969] and pKIKOKm 
[GenBank KC503970]. These plasmids will be available 
from Addgene (http://www.addgene.org/). 

Construction of xylanase KIKO vectors 

To generate xylanase knock-in constructs, PCR products 
carrying the xylanase gene (xynA) under the control of 
the strong inducible tac promoter were amplified from 
the plasmid pNPDX2 [63] using the primers JSP148 and 
JSP 149, and cloned into each of the Cm^ KIKO vectors 
using PstI and Sail to generate pKlKOarsBClxynCm, 
pKlKOlacZChcynCm and pKlKOrbsARClxynCm, In these 
constructs, the xynA gene cassette is inserted in the 
opposite orientation relative to the natural expression of 



the gene encoding the homologous arms. To generate a 
gfjp knock-in construct, the gfp gene was digested from 
pBAVlK-T5-gfp [81] with Sad and Spel and ligated into 
pKlKOrbsARCm at the cognate sites to produce 
pKlKOrbsARClgfpCm, To generate a sucrose utilization 
gene knock-in construct, the cscAKB operon was ampli- 
fied from pCSCX [46] using primers JSP250 and JSP251. 
The PCR product was digested with Spel, purified, and li- 
gated into Spel-digested pKIKO^^^rs^Cm, generating 
pKlKO arsBD^cscAKB, All ligations were transformed 
into rubidium chloride treated E, coli BW23474 
chemically competent cells (Xpir^; see Table 2). For 
pKlKO rbsARClgfpCm, the transformants were plated on 
LB agar supplemented with chloramphenicol and kana- 
mycin. For pKlKOarsBClcscAKBy the transformants 
were plated on the LB agar supplemented with chloram- 
phenicol; colonies were inoculated in 1 ml of M9S 
broth with chloramphenicol and were grown overnight 
at 37°C to confirm the suc"^ phenotype. All of the puta- 
tive recombinants were further screened using colony 
PCR and confirmed using Sanger sequencing of plasmid 
DNA. Sequencing was performed by Australian Genome 
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Table 4 Primers used in this study 



Primer 



Application 



Sequence (S'-SO 



JSPIO 

JSP11 

JSP22 

JSP55 

JSP56 

JSP57 

JSP58 

JSP64 

JSP65 

JSP123 

JSP124 

JSP125 

JSP126 

JSP129 

JSP130 

JSP148 

JSP149 

cscB_F2 

KO Test 



Amplification of arsB fragment (Pmel) 
Amplification of arsB fragment (Pmel) 
Screening oligo for KIKO vector inserts 
Amplification of /acZ fragment 
Amplification of /acZ fragment (Pmel) 
Amplification of rbsAR fragment (Pmel) 
Amplification of rbsAR fragment (Pmel) 
Amplification of R6K oh and bla (Pvull) 
Amplification of R6K oh and bla (Pvull) 
Confirmation of orsB genomic insertion 
Confirmation of arsB genomic insertion 
Confirmation of rbsAR genomic insertion 
Confirmation of rbsAR genomic insertion 
Confirmation of /acZ genomic insertion 
Confirmation of /acZ genomic insertion 
Amplification of xynA (PstI) 
Amplification of xynA (Sail) 
Screening oligo for cscAKB insertion 
Confirmation of insertion junction at Cm end 



GGCGCAGC TGmAAAC GTCCTGACCATCGTA^GG 

GGGTCAGCTGmAAACGTCCCAAATCGCAGCCAAT 

TOTGCGAAGTGATCTOCG 

GCGAGmAAACCA^CCGTGACGTCTCGT 

GCGAGmAAACAAGACTGTOCCCATCGCGT 

GCGA GmAAAC CAAGCGGCA^GTGTATATC 

GCGA GmAAAC CCCAGTCATCmCGGTO 

TAGGC AGCTGG GAGGATATCAAATGGACC 

CCCCCAGCTGGATGCAGGTGGCAC^KCG 

CAACCTGGCTCGACAAAACT 

GTGTCACAAACAGCACAGGC 

CCGAACTGATGAAAGTGCTC 

GCGTAAATCTAAGCCGAACC 

GTCTGAAmCACCTGAGCG 

TCATACAGAACTGGCGATCG 

AGGA CTGCAG CGGAGCTOTCGACTGCACG 

AAACGCTGACTOACGAGGAGCGGCAGAGG 

ATGGCACTGAATATOCATOAGA 

GGAGTGAATACCACGACGAT 



Restriction enzyme sites Included In primers are underlined, and the enzyme name Is listed In the Application column. 



Research Facility (The University of Queensland, St. Lucia, 
QLD, Australia). 

Strain construction 

Engineered strains are shown in Table 2. Gene knock-in 
was performed by homologous recombination using the 
arabinose-inducible X Red recombination system as de- 
scribed previously [23] with minor modifications also 
described previously [46,72]. KIKO vectors containing 
xylanase genes were linearized with NotI and 250 ng of 
purified linear plasmid was electroporated into arabinose- 
induced W(pKD46) and MG1655(pKD46) electrocom- 
petent cells. In parallel, the integration cassette was also 
PCR-amplified from each KIKO vector, purified and again 
250 ng of product was used to transform cells. The 

Table 5 List of kiko vectors, their characteristics and 
genbank accession numbers 



KIKO vector 


Target locus 


Resistance gene 


GenBank accession 


pKIKOarsfiCm 


orsB 


cotP (Cm^) 


KC503965 


pKIKO/acZCm 


lacZ 


catP (Cm^) 


KC503966 


pKIK0r6s/\/?Cm 


rbsAR 


cotP (Cm^) 


KC503967 


pKIKOarsfiKm 


orsB 


ophA (Km^) 


KC503968 


pKIKO/acZKm 


lacZ 


ophA (Km^) 


KC503969 


pKIK0/'6s/\/?Km 


rbsAR 


ophA (Km^) 


KC503970 



The resistance gene listed Is the gene Inserted onto the chromosome after 
homologous recombination. Km'^ kanamycin resistance, Cm'^ chloramphenicol 
resistance. 



transformation efficiency (competency) of the cells was 
determined using 120 ng of pACYC184 (New England 
Biolabs, Ipswich, MA, USA). Integration of the xylanase 
gene was confirmed by colony PGR using three separate 
PCRs: 1) a xynA-specific primer (JSP148) coupled with a 
chromosome-specific primer (JSP123 for the arsB locus, 
JSP129 for the lacZ locus and JSP125 for the rbsAR 
locus); 2) a catP-specihc primer (KO test) coupled with 
a chromosome-specific primer (JSP124 for the arsB 
locus, JSP130 for the lacZ locus and JSP126 for the 
rbsAR locus); and 3) a PGR to screen across the whole 
insertion locus (JSP123 and JSP124, JSP129 and JSP130 
and JSP125 and JSP126, respectively). The resulting W- 
derived strains were designated W ^arsB::xyn, ^fJ tdacZw 
xyn, WArbsAR::xyn, and the resulting K-12(MG166)- 
derived strains were designated MGAarsBvjcyn, MGArbsAR:: 
xyn, and MGAlacZvxyn, 

To generate a double knock-in with xylanase and GFP, 
the Gm^ resistance gene was first removed from E. coli 
MGAlacZ::xyn by flippase-mediated recombination as 
described previously [23,51]. The resulting strain 
{MGAlacZ::xyn¥RT) was transformed with pKD46 and 
Notl-linearized pKlKOrbsARClgfpCm plasmid DNA was 
integrated into the rbsA-rbsR locus using the method 
described above. Transformants were selected on LB 
agar supplemented with chloramphenicol (25 (ig/ml) 
and kanamycin (25 (ig/ml). Integration of the gfp gene in 
the rbsA-rbsR locus was confirmed by colony PGR using 
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JSP 125 and JSP 126 primers. Expression of the gfjp gene in 
the resulting strain MGAlacZ:vcyn ArbsAR::gfjp--Cm was 
confirmed by colony fluorescence under UV light. The 
Cm^ resistance gene was then eliminated as described 
above to produce strain MGAlacZivcyn ArbsAR::gfp, To 
generate a triple knock-in, the sucrose utilization op- 
eron cscAKB from Notl-linearized pKlKO arsBClcscAKB 
was integrated at the arsB locus using the method 
described above. Transformants were selected on LB 
agar supplemented with 25 (ig/ml chloramphenicol and 
the suc^ phenotype confirmed using M9S medium sup- 
plemented with 25 (ig/ml chloramphenicol. Integration 
of the cscAKB genes was verified by colony PGR using 
the JSP123 and cscB_F2 primers [72]. 

Growth rate analysis 

The growth rate of the recombinant strains were deter- 
mined from 96-well microtitre plate cultures monitored 
using a FLUOStar Omega multi-detection microplate 
reader (BMG Labtech, Offenburg, Germany) as described 
previously [46] with the exception that LB medium was 
used instead of defined medium. 

Xylanase assay 

Strains were cultured in LB to an ODeoo of 0.6-0.8 then 
induced for 3 h with 0.1 mM isopropyl-|3-D-thiogalacto- 
pyranoside (IPTG). The cells were harvested by centrifu- 
gation, washed and resuspended in 1 volume of PBS 
buffer, and lysed with PopGultureTM reagent (Novagen, 
Billerica, USA). The lysed cell culture was clarified by 
centrifugation and the xylanase activity measured using 
an EnzGhek® Ultra Xylanase assay kit (Invitrogen, 
Eugene, USA). Fluorescence was measured on FLUOStar 
Omega multi-detection microplate reader (BMG Labtech, 
Offenburg, Germany) every 30 sec min for 30 min at 
room temperature using excitation at 360 nm and 
emission detection at 460 nm. Gommercial xylanase 
from Thermomyces lanuginosus (Sigma, St Louis, USA) 
was used to develop a standard curve. Protein concen- 
tration was determined by the method of Bradford [82] 
using a BioRad Protein Assay kit (BioRad, Gladesville, 
Australia) and bovine serum albumin as a standard. 
Xylanase activity was expressed as change in relative 
fluorescence over time, adjusted for total protein 
present (ARFU/min/mg protein). 

Statistical analysis 

The xylanase activity and the growth rate data were 
analyzed using analysis of variance (ANOVA). All sta- 
tistical analyses were performed using R software 
(http://www.r-project.org/). 

Abbreviations 

CSC. Chromosomally-encoded sucrose catabolism; Cm'^: Chloramphenicol 
resistance; FRT: Flippase recombinase target sites; GPP: Green fluorescent 



protein; HA; Homologous arm; KIKO: Knock-in/knock-out; Km^; Kanamycin 
resistance; MCS: Multiple cloning site; QC: Quality control; RFU: Relative 
fluorescence units; xyn/\: Xylanase gene. 
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